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- RAW SEQUENCE LISTING DATE: 03/04/2002 

PATENT APPLICATION: US/09/950,051 TIME: 15:03:28 

Input Set : A:\3495D209.app 

Output Set: N:\CRF3\03042002\I950051.raw 

3 <110> APPLICANT: YERAMIAN, EDOUARD 

5 <120> TITLE OF INVENTION: GENES AND THE PHYSICS OF THE DNA DOUBLE HELIX 

6 FORMULATION OF A PHYSICS-BASED GENE IDENTIFICATION 

7 (PBGI) METHOD: AS INITIO IDENTIFICATION OF GENES IN 

8 EUKARYOTIC GENOMES 

10 <130> PILE REFERENCE: 034 95-0209-00000 

12 <140> CURRENT APPLICATION NUMBER: 09/950,051 

13 <141> CURRENT FILING DATE: 2001-09-12 

15 <150> PRIOR APPLICATION NUMBER: 60/232,146 

16 <151> PRIOR FILING DATE: 2000-09-13 
18 <160> NUMBER OF SEQ ID NOS : 9 
20 <170> SOFTWARE: PatentIn Ver . 2.1 

22 <210> SEQ ID NO: 1 

23 <211> LENGTH: 213 

24 <212> TYPE: DNA 

25 <213> ORGANISM: Plasmodium falciparum 

27 <4 00> SEQUENCE: 1 

28 atgtgcatac atgttacgtt taatttttat tttgaagata atgattttag tgcgttgaaa 60 

29 gttaaggatg aagaaattgt ttctaagaaa aataatttct ccttttctgc tcttagcaat 120 

30 gattcaaatt ctgtaacaaa aaagtacata gttgatttga ccttactaga taatattata 180 

31 gaatccgtaa gaaataaaag aaatataaaa aga 213 

34 <210> SEQ ID NO: 2 

35 <211> LENGTH: 212 

36 <212> TYPE: DNA 

37 <213> ORGANISM: Plasmodium falciparum 

39 <400> SEQUENCE: 2 

40 agtatatttt tttgaacatc aaattttcgc atcgttggag ctccccaggt gcgttgaaag 60 

41 ttaaggatga agaaattgtt tctaagaaaa ataatttctc cttttctgct cttagcaatg 120 

42 attcaaattc tgtaacaaaa aagtacatag ttgatttgac cttactagat aatattatag 180 
4 3 aatccgaaac caaatacaat tttgcttctg tg 212 
4 6 <210> SEQ ID NO: 3 

47 <211> LENGTH: 225 
4 8 <212> TYPE: DNA 

49 <213> ORGANISM: Plasmodium falciparum 

51 <400> SEQUENCE: 3 

52 atgtatagac gcatacatat tattacattt gtaacgatca atcttttttt cttattatcc 60 

53 ctatcccaca gatatcatga tagcgtccag aatttcttga aggaagaaaa aaataactct 120 

54 gataagttac aagatgatat agatgaggat gaggaaaaat attttgacga ggaaatttta 180 

55 agggaagcca aaaaaaaaag tgaagaatat gataaagacg atga 

58 <210> SEQ ID NO: 4 

59 <211> LENGTH: 225 

60 <212> TYPE: DNA 

61 <213> ORGANISM: Plasmodium falciparum 



225 
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RAW SEQUENCE^ LISTING DATE: 03/04/2002 

PATENT APPLICATION: US/09/950,051 TIME: 15:03:28 

Input Set : A:\34950209.app 

Output Set: N!\CRF3\03042002\I950051.raw 

63 <400> SEQUENCE: 4 

64 aaagaaaaac gcatacatat tattacattt gtaacgatca atcttttttt cttattatcc 60 

65 ctatcccaca gatatcatga tagcgtccag aatttcttga aggaagaaaa aaataactct 120 

66 gataagttac aagatgatat agatgaggat gaggaaaaat attttgacga ggaaatttta 180 

67 agggaagcca aaaaaaaaat aacattcttt tcacacgatt ttgtt 225 

70 <210> SEQ ID NO: 5 

71 <211> LENGTH: 600 

72 <212> TYPE: DNA 

73 <213> ORGANISM: Homo sapiens 

75 <400> SEQUENCE: 5 

76 ataaacattc tttagtccac acatagataa ataaataagg aagcaaatag acacacagaa 60 

77 gagcgggaca gctcctcctc ccgggagaat ttcaattagt aagtgtggaa ggaacaaggc 12 0 

78 agggaggaga atcctcaaca gagccccaca gggaccgtgc gggcgaggcc cccggagggg 180 

79 caccagcact gccgggcaaa cgcctgggca gacgcgggac agctgccaag tctcagacat 24 0 

80 gaccaattac agagggaaac ggcggcaccg cgagggatgg gccgcggccg tgtcacctcc 300 

81 atgccccacg cacactgctc ctgtgggatt cctcccccaa cacgatgccc actctgacca 360 

82 cgaggaaacc tcaagcaagt ccacgtggag gggcattcta caaaacaccc aaccggtcaa 420 

83 ggtcgctgag gccaaggaga gattgggcaa ccgtcacaaa ccagagaagc cgaggagagc 480 

84 tttcagccaa cgccatgtgg ggtcctgagc aggacccacc ggaagttggt gcagctgcct 54 0 

85 aaagaccgtc ctggctgaga agaaacagag cagcgctgct ttctcagagc tgggaaccga 600 

88 <210> SEQ ID NO: 6 

89 <211> LENGTH: 1980 

90 <212> TYPE: DNA 

91 <213> ORGANISM: Homo sapiens 

93 <400> SEQUENCE: 6 

94 acctcgatct cagacttctg gcttccaaaa ccatgagaca cggaatttct gttgtgtgac 60 

95 cagccagttt gtggtactgt ttgtcatggc agcccaagga aaagaataca ttacagcata 120 

96 caaaccatga ctcacattat ctttacttag aacccaaaca aacctctctc cctaagcttt 180 

97 caatcacaga ggcacatgat cttgttcagc agcctagaaa accaaggccc agcggagcca 240 

98 cccgtaggca cccactcccc atagcctggc acacacacac ggcagagcca cccacaggca 300 

99 cccactcctc atagtccagc acacacacgg cagagccacc cgcaggcacc cactccccat 360 

100 agcccggcac acacgtggac catgccaccc tccacgtgcg cctggggagc aaagcagcac 420 

101 agcctgaact gcccctcagc tcttcctcct gagtctaaaa cacgcacatg cgccccaggc 480 

102 caattccaag ttttgtaaac tgagcaacag ctcttgggaa acaaaaacac agctactgtt 540 

103 tattctcctg gagctggctg tacaccccaa caaggaaggg agggcttgct gagcctcctg 600 

104 tctggacaac atgcaccaag gaggagtata aaagccccac aaacccgagc acctcactca 660 

105 ctcgctcacc cactccctcc catctccccc agctcaaccc ccagcacagc agcatccacc 720 

106 atgtccgtct gctccagcga cctgagctac agcagccgcg tctgccttcc tggttcctgt 780 

107 gactcttgct ccgactcctg gcaggtggac gactgcccag agagctgctg cgagcccccc 840 

108 tgctgcgccc ccagctgctg cgccccggcc ccctgcctga gcctggtctg caccccagtg 900 

109 agccgtgtgt ccagcccctg ctgcccagtg acctgcgagc ccagcccctg ccaatcaggc 960 

110 tgcaccagct cctgcacgcc ctcgtgctgc cagcagtcta gctgccagct ggcttgctgt 1020 

111 gcctcctccc cctgccagca ggcctgctgc gtgcccgtct gctgcaagac tgtctgctgc 1080 

112 aagcctgtgt gctgtgtgcc cgtctgctgt ggggattctt catgctgcca gcagtctagc 114 0 

113 tgccagtcag cttgctgcac ctcctccccc tgccagcagg cctgctgtgt gcccatctgc 1200 

114 tgcaagcctg tctgctctgg gatttcctct tcgtgctgcc agcagtctag ctgtgtgagc 1260 

115 tgtgtgtcca gcccctgctg ccaggcggtc tgtgagccca gcccctgcca atcaggctgc 1320 

116 atcagctcct gcacgccctc gtgctgccag cagtctagct gccagccggc ttgctgcacc 1380 

117 tcctcctcct gccagcaggc ctgctgcgtg cccgtctgct gcaagactgt ctgctgcaag 1440 
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118 cctgtgtgct ctgaggattc ctcttcatgc tgccagcagt ctagctgcca gccggcttgc 1500 

119 tgcacctcct ctccctgcca gcaggcttgc tgtgtgcctg tctgctgcaa gcctgtgtgc 1560 

120 tgcaagcctg tcggctctgt gcccatctgc tctggggctt cctctctgtg ctgccagcag 1620 

121 tctagctgcc agccagcttg ctgcacctcc tcccaaagcc agcagggctg ctgcgtgccc 1680 

122 gtctgctgca agcctgtgag ctgtgtgcct gtttgctctg gggcttcctc ttcatgctgc 1740 

123 cagcaatcta gctgccagcc agcttgctgc accacctcct gctgcagacc ctcctcctcc 1800 

124 gtgtccctcc tctgccgccc cgtgtgcagg cccgcctgct gcgtgcccgt cccttcctgc 1860 

125 tgtgctccca cctcctcctg ccaacccagc tgctgccgcc cagcctcctg cgtgtccctc 1920 

126 ctctgacgcc ccgtgtgctc ccgcccagcc tgctgaggcc tccgctcagg tcagaagccc 1980 

129 <210> SEQ ID NO: 7 

130 <211> LENGTH: 1800 

131 <212> TYPE: DNA 

132 <213> ORGANISM: Homo sapiens 

134 <400> SEQUENCE: 7 

135 ggatgagagg gggactcatg gaggaacagc cacgccttga ccctgagatg gccttgcagg 60 

136 gagggtaact gaaaatttac ccactgggga cagttgccta cttactaaaa cagttccagc 120 

137 caccaccgca gcccctggaa ggccatcccc ccagaaaatc ccccaggtct cagcagggcc 180 

138 ttgtccacct gtgccctcca gtgtcgccca tgtcaacctc acctaagagg ggcctgacgc 240 

139 acggtcctgc aggtgcggac tctgggtcct gacagcccat gcggaacctg gtgcccccag 300 
14 0 aggagggcct ggggcagtgc cagttttggg gaatcatgtg catccatcca cccactccat 36 0 

141 gatgctttcg tcctgatcga gtcccttgtc tcccgcgcag gtgcagcagc ccctccctct 420 

142 ccccccgcat tgctgctgaa cgggcagaac cctcgggcgg gcggcacaca gggagggtga 480 
14 3 ccaggcctgg aggctgtagt gcccggaccc caggccagct tcctggaagg tgaccctgca 54 0 

144 gggtgggctc tcccaggtgg gacagtgggt gggacagtcc tggggcctgg agagccccac 600 

145 agcccagggc acggcagcca atgaccaggc tcaggaagac ccaggcatgg aggctgagcc 660 
14 6 gggactgagc cttcctgggc gtggctgtga gttccacctg gtgaccccct ggaggagtta 72 0 
147 ggccactgtc ccccgtgact tctaggttaa gtcactcatt catagaaaca gtcatggcta 780 
14 8 gagagcaatc tgagctcaaa accatgtatc cccaggagca ctacagaaaa agagaatcag 84 0 
14 9 gcgaccaagg ggagtttatt ggggagcagg aggaggtgct gacaggttca agtcgaggcc 900 

150 aagtgacctg gggcagagaa gctgggaggg aggacagggg acccaacagg caggtgggcc 960 

151 cctgctggga ggcaggagct ggggagcttc gaggatggag attcctggga gtatggaggg 102 0 

152 gggggtcacc tcagcacatg ggggccccgt cccaagcggg ggcaacctcc taacccgagt 1080 

153 caggaccagt tggccctggg ggatgtgcac atcagcaact ggactcctgg cctgagcaga 1140 

154 ggcctcagca ggccaggcgg gagcacgcgg ggcggcagag gagggacacg caggaggccg 12 00 

155 ggcggcagca gctggcctgg taggaggagg caggggcaca gcaggaggag atgggcacgc 1260 

156 agcaggcggg cctgcatatg gggcggcaga ggagggacac ggaggaggag ggtctgcagc 1320 

157 aggaggtggt gcagcaagcc ggctgacagc tagactgctg gcagcatgaa gtggaagccc 1380 

158 cagagcagac gggcacacag cagatgggtt tgaagcagac aggcttgcaa cagacaggca 1440 

159 cgtagcagga ctgctggcag ggggaggagg tgcagcaagt cggctggcag ctagaatgct 1500 

160 ggcagcatga agaggaatcc tcagaacagg tgggcacaca gcacacgggc ttgcagcaga 1560 

161 caggcacaca gcaggactgc tggcaggagg aagaggcaca gcaagttggc tggcagctag 1620 

162 actgctggca gcatgaagag gaatccttag agcaggtggg caggcagcac acaggcttgc 1680 
16 3 agcagacggg cacgcagcag gcctgctggc agggggagga ggcgcagcaa gccggctggc 1740 
164 agcacgaggg cgtgcaggag ctggtgcagc ctgattggca ggggctgggc tcacaggccg 1800 

167 <210> SEQ ID NO: 8 

168 <211> LENGTH: 1620 

169 <212> TYPE: DNA 

170 <213> ORGANISM: Plasmodium falciparum 
172 <4 00> SEQOENCE: 8 
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! tatgcacgtg tgtgtgtgta taatatacaa ttcttatttc 

: tgggacattt cgtattttgc atcctaacaa catttgcaca 

i aaaattatat atatatatat atatattata tatacatgta 

; tatttcgttt taaagtacca cattacaaag acacatatgt 

' atctttttca tccgtcaggc atattacaca ttcccttaaa 

i aagcatataa tatatatagt atatataatg tacattcata 

I tacatatata tatatatata tatatatata tgatgtacac 

I tttacaggat ccgtttttgt ttctggagat ttttctattc 

. aatgatctgt taccaaattg aatcttctgt ctataaagaa 

• ttgtttttat tctcgttcat ctgattttct tgtaagtagg 

! agtatgggcg taccaatcgc tttaaggact attaaaatag 

: ggtttacatt ttatatcatt tatattcatg ttattcaaat 

i ctcatataaa cttggtttaa attcttcgga aatatttttg 

; ggacaatatc tagaaagaaa agaaaattaa caaaacgaac 

' aaaatataaa aaaaaaaaaa aattgataaa taaacataaa 

( atatatatta atatatatat atatatatat atatatatat 

I tttaaaaata ttacatggtt tcttttgctt ctgttaactc 



tgtctacata ( 
cttcaaaata aaaaaaaaaa ] 
tttactttta atattcatat ] 
ctacaaggta aaattgctgt : 
aataaaacat aaatataaat : 
cgtataaatg aataaaaata ; 
acctgtcaga caaataatta ' 
caaaaatttc ttgaacctca < 
ttattttata tttatcttga ! 
catagttgta ttgagcctga f 
gaatgatata ttcataacta f 
ttattccttc atttaattca : 
taatcgtttg atactttgta : 



190 tagatacctc 



195 taaagtgatt ai 



agaagatata tatatata' 



tacatcatgt aaggcatcaa aaatgaaatt 
aatttttaat gtctttctac gtaagttaag 
atttatattt ggtccttgcg atattagttc 
ttaagtcaaa atagaagaga aaaaaaatga 
tgtacatata tatatatata tatttatatt 
tta tgggattatc aaatgatatt 
aaa aaaagttttt aattaaatat 
tatatggagc aaaatattat 
aataatcatc tcgattgtta 
tcccattata caacaagaaa attataaaat gtaac 
<210> SEQ ID NO: 9 
<211> LENGTH: 1560 
<212> TYPE: DNA 
<213> ORGANISM: Plasmod; 
<4 00> SEQUENCE: 9 
acttataaat aggttgtata caaatatata ttatttatat 
ttggtaacaa tttctttcca tcatcatagg gaagatattt 
agcctgaaat atttaaaaat aaataaataa atatatatat 
ttttgataat atatatatat atatatatat attttttttt 
tttgctgaca caatgctggt aaacgacgat ttgatttttt 
tatataaata aagcaaaata taaaacgtat atatatatat 
tatttgtgta caagaattac catataaaat aggcttgtaa 
taaatatatg tctcatttga agaatactat atttacttaa 
attgcttcac ttttttttca tataaatgcc tgaaaagggg 
aataataata tatatatata tatatattta ttcttttatt 
actaactccc ataatatatc cttcaacaat gatggaaaaa 
tcgtctaatg ataccaaaaa ctgaaaaaag aaaataaaac 
ttgttattac atatatgtat ataaataaat atatatatat 
atatataaac aaatgatata ttccttttta ttaaaaaaaa 
catattcttg acaattcaaa agttctgcat taaaagaata 
gcatccaatc acgaacaatt ctccacgtta tatcaaaaat 
cttgatactg ttcatgtctt aatcttttat taattatcca 



tgaatttctt 

aatgaaataa 
tttatatatt 
gtttgtggct 
gtgttatata 
cattcttttt 
gataacgatg 
tattttaaag 



ttattatttc 
acaacggatg 
gtgttatttg 
gaagtatact 
ttgcttctta 
gtttggaaac 
taatataatg 
ttctttttaa 
aacccattat 



1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



tttattttgt taataaattc ( 
actcttatct ttcggacgat : 
atataatata tataattacc : 
tcatttacct ccatatgata ; 
ctggttataa aaatcagccc ; 



ttttatgtaa 
gttagccgct 
attatcatta 
atatatatat 
actccaaatg 



ctctgttaca 
aaaatacata 
attttttttt 

attcaagtgt 
atatatatat 

ttcaggaaca 
atatgtattt 
aagatgcaaa 
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225 acattttcat attaaaagat tcatttatac gaaataagtt taatatctct tcattttcta 1080 

226 atctttcaat tattaaatgt actaatctcc atgcacattc tccatattct gtatgttttc 114 0 

227 taaataaata ttaaaatata tatattttta ttttatatat aaacataaat tcaaaatgaa 1200 

228 actttttaaa catattagac gtacatatgc tatgtttttt aaaatattat atatatatat 1260 

229 tatgaagaat tgttttctta caaaaaaaat ttatatatca tagaacttat tatagaacga 1320 

230 tcacattgtg gtataggtat tgcatatctt gaagattctc taatactaat ttttgtattc 1380 

231 ttatcctcat acaaactttt actttcgata atttttaaat tatcatatac ttttatttca 144 0 

232 ctagaaaaat agcgatagtt aatatatctt ctctttaaaa caaattttgt acatttatac 1500 

233 cacatgacaa caaaatttaa ttaaaaatta atgaaaaaaa aaaaaaaaaa aaaaaaaaaa 1560 
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